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(54) Method for editing a subband encoded audio signal 



(57) A digital signal of which input data has been 
segmented as block each having a predetermined data 
amount and highly efficiently encoded along with an 
adjacent block is decoded, edited, and then highly effi- 
ciently encoded. A delay that takes place in such signal 



processes is compensated. Thus, part of a digital signal 
that has been highly efficiently encoded digital signal 
can be edited. 
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Description 

BACKGROUND OF THE INVENTION 
Field Qf the Invention 

[0001] The present invention relates to a signal 
processing apparatus and a signal processing method 
that allow editing a part of a digital signal that has been 
segmented as blocks each of which has a predeter- 
mined data amount and each block to be highly effi- 
ciently encoded along with an adjacent block. 

Description of the Related Art 

[0002] As a related art reference of a highly effi- 
ciently encoding method for an audio signal, for exam- 
ple, a transform encoding method is known. The 
transform encoding method is one example of a block- 
segmentation frequency band dividing method. In the 
transform encoding method, a time-base audio signal is 
segmented into blocks at intervals of a predetermined 
unit time period. The time-base signal of each block is 
converted into a frequency-base signal (namely, orthog- 
onally transformed). Thus, the time-base signal Is 
divided into a plurality of frequency bands. In each fre- 
quency band, blocks are encoded. As another related 
art reference, a sub band coding (SBC) method as an 
example of a non-block-segmentation frequency band 
dividing method is known. In the SBC method, a time- 
base audio signal is divided Into a plurality of frequency 
bands and then encoded without segmenting the signal 
into blocks at intervals of a predetemriined unit time 
period. 

[0003] As another related art reference, a highly 
efficiently encoding method that is a combination of the 
band division encoding method and the SBC method is 
also known. In this highly efficiently encoding method, a 
signal of each sub band is orthogonally transformed into 
a frequency-base signal con^esponding to the transform 
encoding method. The transformed signal is encoded in 
each sub band. 

[0004] As an example of a band dividing filter used 
for the above-described sub band coding method, for 
example a QMF (Quadrature Miror Filter) is known. 
The QMF is described in for example R. E. Crochiere 
"Digital coding of speech in sub bands" Bell Syst. Tech. 
J. Vol. 56. No. 8 (1976). An equal band width filter divid- 
ing method for a poly-phase quadrature filter and an 
apparatus thereof are described in ICASSP 83, BOS- 
TON "Polyphase Quadrature filters - A new sub band 
coding technique", Joseph H. Rothwiler. 
[0005] As an example of the orthogonal transfomn 
method, an input audio signal is segmented into blocks 
at intervals of a predetemriined unit time period (for each 
frame). Each block is transformed by for example a fast 
Fourier transforming (FFT) method, a discrete cosine 
transforming (DCT) method, or a modified OCT trans- 



fomiing (MDCT) method. As a result, a time-base signal 
is converted into a frequency-base signal. The MDCT is 
described in for example ICASSP 1987, "Sub 
band/Transform coding Using Rlter Bank Designs 
5 Based on Time Domain Aliasing Cancellation". J. P. 
Princen and A. B. Bradley, Univ. of Sun-ey Royal Mel- 
bourne Inst of Tech. 

[0006] On the other hand, an encoding method that 
uses a frequency division width In consideration of the 

10 hearing characteristics of humans for quantizing each 
sub band frequency component is known. In other 
words, so-called critical bands of which their band 
widths are proportional to their frequencies have been 
widely used. With the critical bands, an audio signal 

15 may be divided into a plurality of sub bands (for exam- 
ple, 25 sub bands). According to such a sub band cod- 
ing method, when data of each sub band is encoded, a 
predetemriined number of bits is allocated for each sub 
band. Alternatively, an adaptive number of bits is allo- 

20 cated for each sub band. For example, when MDCT 
coefficient data generated by the MDCT process is 
encoded with the above-described bit allocating 
method, an adaptive number of bits is allocated to the 
MDCT coefficient data of each block of each sub band. 

25 With the allocated bits, each block is encoded. 

[0007] An example of a related art reference of 
such a bit allocating method and an apparatus con-e- 
sponding thereto is described as "a method for allocat- 
ing bits conresponding to the strength of a signal of each 

30 sub band" in IEEE Transactions of Acoustics, Speech, 
and Signal Processing, vol. ASSP-25, NO. 4, August 
(1977). As another related art reference, "a method for 
fixedly allocating bits con-esponding to a signal to noise 
ratio for each sub band using a masking of the sense of 

35 hearing" is described in ICASP, 1 980, The critical band 
coder - digital encoding of the perceptual requirements 
of the auditory system", M. A. Kransner Mlt. 
[0008] When each block is encoded for each sub 
band, each block is nomnaiized and quantized for each 

40 sub band. Thus, each block is effectively encoded. This 
process is referred to as block floating process. When 
MDCT coefficient data generated by the MDCT process 
is encoded, the maximum value of the absolute values 
of the MDCT coefficients is obtained for each sub band. 

45 Con'esponding to the maximum value, the MDCT coef- 
ficient data is nomnaiized and then quantized. Thus, the 
MDCT coefficient data can be more effectively encoded. 
The nomrializing process can be performed as follows. 
From a plurality of numbered values, a value used for 

50 the normalizing process is selected for each block using 
a predetermined calculating process. The number 
assigned to the selected value is used as nonnatizatlon 
information. The plurality of values are numbered so 
that they increment by 2 dB of an audio level. 

55 [0009] The above-described highly effectively 
encoded signal is decoded as follows. With reference to 
the bit allocation information, the normalization infomia- 
tion, and so forth for each sub band, MDCT coefficient 
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'data is 'generated corresponding to a signal that has 
been highly efficiently encoded. Since a so-called 
Inversely orthogonally transforming process is per- 
formed corresponding to the MDCT coefficient data, 
time-base data is generated. When the highly efficiently 5 
encoding process is performed, if the frequency band is 
divided into sub bands by a band dividing filter, the time- 
base data Is combined using a sub band combining fil- 
ter. 

[0010] When normalization infomiation Is changed 10 
by an adding process, a subtracting process, or the like, 
a reproduction level adjusting function, a filtering func- 
tion, and so forth can be accomplished for a time-base 
signal of which an encoded data has been decoded that 
is known as the editing method of data. According to 15 
this method, since the reproduction level can be 
adjusted by a calculating process such as an adding 
process or a subtracting process, the structure of the 
apparatus becomes simple. In addition, since a decod- 
ing process, an encoding process, and so forth are not 20 
excessively required, the reproduction level can be 
adjusted without a deterioration of the signal quality. In 
addition, in this method, an encoded signal can be mod- 
ified without changing the time period of the generated 
signal by decoding, part of the signal generated by the 25 
decoding process can be changed with no influence 
from other parts. 

[0011] In other than the method for changing nor- 
malization information, when the chronological relation 
between a decoded signal and an original signal so 
(namely, a delay amount of phases) Is obtained, 
encoded data that has the same chronological relation 
with a decoded signal can be generated. 
[0012] When encoded data is changed in the 
above-described method, an editing operation such as 35 
a level adjustment can be peifomied con^esponding to 
an Increase or decrease of one value of normalization 
Information (for example, 2 dB). Thus, such a level 
adjustment cannot be more precisely performed. In the 
chronological direction, an editing operation such as a 40 
level adjustment cannot be performed in the accuracy 
exceeding the minimum time unit corresponding to the 
encoding data fomnat of the applied encoding method 
(the minimum time unit is for example, 1 frame). 
[001 3] Thus, due to such restrictions corresponding 45 
to the applied encoding method and encoding data for- 
mat, the editing operations in the reproduction level and 
the frequency region and the editing operation in the 
chronological direction cannot be more accurately per- 
formed, so 

OBJECTS AND SUMMARY OF THE INVENITION 

[001 4] Therefore, an object of the present invention 
is to provide a digital signal processing apparatus, a dig- 55 
ital signal processing method, a digital signal receding 
apparatus, and a digital signal recording method that 
allow an editing process for such as a reproducing level 



that is less affected by an applied encoding format to be 
perfomned. Another object of the present invention is to 
provide a record medium on which such data is 
recorded. 

[0015] A first aspect of the present invention is a 
digital signal processing apparatus for processing an 
input digital signal that has been segmented as blocks 
each having a predetermined data amount and highly 
efficiently encoded along with adjacent blocks, compris- 
ing a decoding means for decoding the highly efficiently 
encoded digital signal along with adjacent blocks, a 
changing process means for changing the decoded dig- 
ital signal, an encoding means for highly efficiently 
encoding the changed digital signal along with adjacent 
blocks, and a delay compensating means for compen- 
sating a delay of the decoded signal decoded by the 
decoding means. 

[0016] A second aspect of the present Invention is a 
digital signal processing method for processing an input 
digital signal that has been segmented as blocks each 
having a predetermined data amount and highly effi- 
ciently encoded along with adjacent blocks, comprising 
the steps of (a) decoding the highly efficiently encoded 
digital signal along with adjacent blocks, (b) changing 
the decoded digital signal, and (c) highly efficiently 
encoding the changed digital signal along with adjacent 
blocks and compensating a delay of the decoded signal 
decoded at step (a). 

[0017] These and other objects, features and 
advantages of the present invention will become more 
apparent in light of the following detailed description of 
a best mode embodiment thereof, as illustrated in the 
accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0018] 

Rg. 1 is a block diagram showing an example of the 
structure of a digital signal recording apparatus 
according to the present invention; 
Fig. 2A is a schematic diagram for explaining an 
orthogonal transform block size in the case that a 
supplied signal is semi-regular; 
Rg. 2B is a schematic diagram for explaining an 
orthogonal transform block size of short mode In 
the case that a supplied signal is non-regular; 
Rg. 20 is a schematic diagram for explaining an 
orthogonal transfonn block size of middle mode-a 
in the case that a supplied signal is non-regular; 
Rg. 2D is a schematic diagram for explaining an 
orthogonal transfonn block size of middle mode-b 
in the case that a supplied signal is non-regular; 
Rg. 3 is a schematic diagram showing an example 
of an encoding data format according to the present 
invention; 

Rg. 4 Is a schenratic diagram showing details of 
data of the first byte of Rg. 3; 
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* Fig, 5 is'a block diagram showing an example of the 
structure of a bit allocation calculating circuit; 
Fig. 6 is a graph showing an example of a spectrum 
of frequency bands divided corresponding to a crit- 
ical band, a block floating, and so forth; 
Rg. 7 is a graph showing an example of a masking 
spectrum; 

Rg. 8 Is a graph for explaining a combination of a 
minimum audible curve and a masking spectrum; 
Rg. 9 is a block diagram showing an example of the 
structure of a digital signal reproducing and/or 
recording apparatus according to the present inven- 
tion; 

Rg. 1 0 is a schematic diagram for explaining a gen- 
eration of normalization information; 
Rg. 1 1 is a schematic diagram for explaining a level 
operation by changing normalization information; 
Rg. 12 is a schematic diagram for explaining a fil- 
tering operation by changing normalization Informa- 
tion; 

Rg. 13 is a schematic diagram for explaining an 
overlap of frames of encoded data; 
Rg. 14 is a block diagram showing an example of 
the structure for performing an editing process 
according to the present invention; 
Rg. 15A is a schematte diagram showing the rela- 
tion between a signal waveform and frames 
recorded on a record medium; 
Rg. 15B is a schematic diagram showing the rela- 
tion between a signal waveform and frames of 
which a decoding process and an effect process 
have been performed; 

Rg. 15C is a schematic diagram showing the rela- 
tion between a signal waveform and frames of 
which an encoding process has been performed; 
Rg. 1 6 is a schematic diagram for explaining an 
example of the chronological relation of individual 
frames In the editing process according to the 
present invention; 

Rg. 17A is a schematic diagram showing the case 
that input PCM data that is filtered with windows 
and encoded for each frame; 
Fig. 17B is a schematic diagram showing the case 
that part of the PCM data that has been encoded as 
shown in Rg. 17A and recorded on a record 
medium is edited; 

Rg. 17C is a schematic diagram showing the case 
that filtering positions of the windows are compen- 
sated for a delay compensation amount; and 
Rg. 18 is a schematic diagram showing an 
encoded data format corresponding to the MPEG 
audio format. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

[0019] Next, with reference to Fig. 1, an example of 
the structure of a digital signal recording apparatus 



according to the present invention will be described. An 
embodiment of the present invention is a digital signal 
recording apparatus having an encoding process sys- 
tem for perfonning a highly efficient encoding process 

5 for an input digital signal such as an audio PCM (Pulse 
Code Modulation) signal conesponding to sub band 
coding (SBC) process, adaptive transform coding (ATC) 
process, and adaptive bit allocating process. In this 
example, as an input digital signal, a digital audio data 

70 signal of which an audio signal (of a speech of a person, 
a singing voice of a person, an instrumental sound, or 
the like is digitized), a digital video signal, or the like can 
be handled. 

[0020] When the sampling frequency is 44.1 kHz, 
15 an audio PCM signal with a frequency band of 0 to 2 
kHz is supplied to a band dividing filter 101 through an 
input terminal 100. The band dividing filter 101 divides 
the supplied signal into a signal with a sub band of 0 to 
1 1 kHz and a signal with a sub band of 1 1 kHz to 22 
20 kHz. The signal with the sub band of 11 to 22 kHz is 
supplied to an MDCT (Modified Discrete Cosine Trans- 
fonm) circuit 103 and block designating circuits 109, 
110, and 111. 

[0021] The signal with the sub band of 0 kHz to 1 1 

25 kHz is supplied to a band dividing filter 102. The band 
dividing filter 102 divides the supplied signal into a sig- 
nal with a sub band of 5.5 kHz to 1 1 kHz and a signal 
with a sub band of 0 to 5.5 kHz. The signal with the sub 
band of 5.5 to 1 1 kHz is supplied to an MDCT circuit 1 04 

30 and the block designating circuits 109, 110, and 111. 
On the other hand, the signal with the sub band of 0 to 
5.5 kHz is supplied to an MDCT circuit 105 and the 
block designating circuits 109, 110, and 111. Each of 
the band dividing filters 101 and 102 can be composed 

35 of a QFM filter or the like. The block designating circuit 
1 09 designates the block size corresponding to the sup- 
plied signal. Information that represents the designated 
block size is supplied to the MDCT circuit 103 and an 
output tenminal 113. 

40 [0022] The block designating circuit 1 1 0 designates 
the block size corresponding to the supplied signal. 
Information that represents the designated block size is 
supplied to the MDCT circuit 1 04 and an output tenminal 
115. The block designating circuit 111 designates the 

45 block size corresponding to the supplied signal. Infor- 
mation that represents the designated block size is sup- 
plied to the MDCT circuit 105 and an output terminal 
117. The block designating circuits 109. 110, and 111 
cause the block size or the block length to be adaptively 

50 Changed con'esponding to the input data before the 
orthogonally transforming process is performed. 
[0023] Rgs. 2A, 2B, 2C, and 2D show examples of 
data of individual sub bands supplied to the MDCT cir- 
cuits 103, 104, and 105. The block designating circuits 

55 1 09, 1 1 0, and 1 1 1 independently designate the sizes of 
orthogonally transformed blocks of individual sub bands 
that are output from the band dividing filters 101 and 
102. In addition, the MDCT circuits 103, 104, and 105 
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tan change time resolutions corresponding to time 
characteristics and frequency distributions of the sig- 
nals. When the input signal is chronotogicaliy semi- 
steady, a long mode of which the size of each orthogo- 
nally transformed block is for example 1 1 .6 ms is used. 5 
[0024] On the other hand, when the input signal is 
non-steady, one of modes of which the size of each 
orthogonally transformed block Is 1/2 or 1/4 of the size 
of each orthogonally transformed block of the long 
mode is used. In reality, in a short mode, the size of 10 
each orthogonally transformed block Is 1/4 of the size of 
each orthogonally transfomried block of the long mode. 
Thus, In the short mode, the size of each orthogonally 
transformed block is 2.9 ms as shown in Fig. 2B. There 
are two middle modes that are a middle mode a and a is 
middle mode b. In the middle mode a, the size of one 
orthogonally transformed block is 1/2 of the size of each 
orthogonally transformed block of the long mode and 
the size of another orthogonally transformed block Is 1/4 
of the size of each orthogonally transformed block of the 20 
long mode. Thus, in the middle mode a, the size of one 
orthogonally transformed block is 5.8 ms and the size of 
another orthogonally transformed block Is 2.9 ms as 
shown in Fig. 2C, In the middle mode b, the size of one 
orthogonally transformed block is 1/4 of the size of each 25 
orthogonally transformed block of the long mode and 
the size of another orthogonally transformed block is 1 /2 
of the size of each orthogonally transfonned block of the 
long block. Thus, in the middle mode b, the ^ze of one 
orthogonally transfonned block is 2.9 ms and the size of 30 
another orthogonally transfomried block is 5.8 ms as 
shown in Fig. 2D. With such various time resolutions, 
complicated Input signals can be handled. 
[0025] To consider a limitation caused from the cir- 
cuit scale of the apparatus and or the like, size of each as 
orthogonally transformed block can be divided In more 
complicated manners. Thus, It Is clear that real input 
signals can be more adequately processed. The block 
size is designated by the block designating circuits 109, 
110, and 111. Information that represents the desig- 40 
nated block size is supplied to the MDCT circuits 103, 
1 04, and 1 05, a bit allocation calculating circuit 1 1 8, and 
the output temninals 1 13, 1 15, and 117. 
[0026] Returning to Fig. 1, the MDCT circuit 103 
performs the MDCT process corresponding to the block 4S 
size designated by the block designating circuit 109. 
High band MDCT coefficient data or frequency-base 
spectrum data that is generated by such a process is 
combined for each critical band and supplied to the 
adaptive bit allocation encoding circuit 106 and the bit 50 
allocation calculating circuit 1 1 8. The MDCT circuit 1 04 
perfonms the MDCT process corresponding to the block 
size designated by the block designating circuit 110. 
Middle band MDCT coefficient data or frequency-base 
spectrum data generated by such a process is supplied 55 
to the adaptive bit allocation encoding circuit 107 and 
the bit allocation calculating circuit 1 1 8 after the critical 
band width thereof is divided in consideration of the 



effectiveness of the block floating process. 
[0027] The MDCT circuit 105 perfomrrs the MDCT 
process corresponding to the block size designated by 
the block designating circuit 111, As the result of the 
process, low band MDCT coefficient data or frequency- 
base spectrum data is combined for each critical band 
and then supplied to the adaptive bit allocation encod- 
ing circuit 108 and the bit allocation calculating circuit 
118. The critical bands are frequency bands that are 
divided in consideration of the hearing characteristics of 
humans. When a particular pure sound is masked with 
a narrow band noise that has the same strength thereof 
and that is in the vicinity of the frequency band of the 
pure sound, the band of the narrow band noise is a crit- 
ical band. The band widths of the critical bands are pro- 
portional to their frequencies. The frequency band of 0 
to 22 kHz is divided into for example 25 critical bands. 
[0028] The bit allocation calculating circuit 1 1 8 cal- 
culates for example the masking amount, energy, and/or 
peak value for each sub band in consideration of the 
above-described critical bands and block floating for a 
masking effect (that will be described later) correspond- 
ing to the supplied MDCT coefficient data or frequency- 
base spectrum data and block size information. Corre- 
sponding to the calculated results, the bit allocation cal- 
culating circuit 118 calculates the scale factor, and the 
number of allocated bits for each sub band. The calcu- 
lated number of allocated bits is supplied to the adaptive 
bit allocation encoding circuits 1 06, 1 07, and 1 08. In the 
following description, each sub band as a bit allocation 
unit is refen-ed to as unit block. 
[0029] The adaptive bit allocation encoding circuit 
106 re-quantizes the spectrum data or MDGT coeffi- 
cient data supplied from the MDCT circuit 103 corre- 
sponding to the block size infomiation supplied from the 
block designating circuit 109 and to the number of allo- 
cated bits and the scale factor information supplied from 
the bit allocation calculating circuit 118. As the result of 
the process, the adaptive bit allocation encoding circuit 
106 generates encoded data corresponding to the 
applied encoding format. The encoded data is supplied 
to a calculating device 120. The adaptive bit allocation 
encoding circuit 107 re-quantizes the spectrum data or 
MDCT coefficient data supplied from the MDCT circuit 
104 corresponding to the block size information sup- 
plied from the bbck designating circuit 110 and to the 
number of allocated bits and scale factor information 
supplied from the bit allocation calculating circuit 118. 
As the result of the process, encoded data correspond- 
ing to the applied encoding fonnat is generated. The 
encoded data is supplied to a calculating device 121 . 
[0030] The adaptive bit allocation encoding circuit 
108 re-quantizes the spectrum data or MDCT coeffi- 
cient data supplied from the MDCT circuit 1 05 con-e- 
sponding to the block size information supplied from the 
block designating circuit 110 and to the number of allo- 
cated bits and scale ^or information supplied from the 
bit allocation calculating circuit 1 1 8. As the result of the 
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process, encoded data corresponding to the applied 
encoding format is generated. The encoded data is sup- 
plied to a calculating device 122. 
[0031] Fig. 3 shows an example of the format of 
encoded data. In Fig. 3, numeric values 0. 1 , 2, 21 1 5 
on the left side represent bytes. In this example, one 
frame is composed of 212 bytes. At the 0-th byte posi- 
tion, block size infomiatlon of each sub band designated 
by the block designating circuits 109, 110, and 111 
shown In Fig. 1 is placed. At the first byte position, infor- 
mation that represents the number of unit blocks is 
placed. In the high band, the probability of which no bits 
are allocated to unit blocks by the bit allocation calculat- 
ing circuit 118 and thereby they are not recorded 
becomes high. Thus, to deal with such a situation, the 
number of unit blocks is designated in such a manner 
that more bits are allocated to the middle band region 
and the low band region that largely affect the sense of 
hearing than the high band region. In addition, at the 
first byte position, the number of unit blocks in which bit 
allocation information is dually written and the number 
of unit blocks in which scale factor infonnatlon is dually 
written are placed. 

[0032] To connect an en'or, the same information is 
dually written. In other words, data recorded at a partic- 
ular byte is dually recorded to another byte. Although 
the strength against an error is proportional to the 
amount of data that is dually written, the amount of data 
used for spectrum data decreases. In the example of 
the encoding format, since the number of unit blocks in 
which bit allocation information is dually written and the 
number of unit blocks in which scale factor information 
is dually written are independently designated, the 
strength against an error and the number of bits used 
for spectrum data can be optimized. The relation 
between a code in a predetermined bit and the nunnber 
of unit blocks has been defined as a format. 
[0033] Rg. 4 shows an example of contents of eight 
bits of the first byte. In this example, the first three bits 
represent the number of contained unit blocks. The next 
two bits represent the number of unit blocks to which the 
bit allocation information is dually written. The last three 
bits represent the number of unit blocks unit blocks to 
which the scale factor information is dually written. 
[0034] At the second byte position shown in Fig. 3, 
the bit allocation information of each unit block is 
placed. One unit block is composed of for example four 
bites. Thus, the bit allocation infonmation for the number 
of unit blocks starting with 0-th unit block is placed. The 
bit allocation infomnation is followed by scale factor 
infonnation of each unit block. For the scale factor infor- 
mation, each unit block is composed of for example six 
bits. Thus, the scale factor information for the number of 
unit blocks starting with the 0-th unit block is placed. 
[0035] The scale factor infonnation is followed by 
spectrum data of each unit block. The spectrum data for 
the number of unit blocks that are really contained is 
placed. Since the data amount of spectrum data con- 



tained in each unit block has been defined as a format, 
with the bit allocation information, the relation of data 
can be obtained. When the number of bits allocated to a 
particular unit block is zero, the unit block Is not con- 
tained. 

[0036] The spectrum information is followed by the 
scale factor that is dually written and the bit allocation 
Infonnation that is dually written. The scale factor infor- 
mation and the bit allocation infonnation are dually writ- 
ten corresponding to the dual write information shown in 
Fig. 4. At the last byte (21 1-st byte) and the second last 
byte (210-th byte), information at the 0-th byte and infor- 
mation at the first byte are dually written. The two bytes 
in which such information is dually written has been 
defined as a fomiat. However, scale factor information 
that is dually written and the bit allocation infonnation 
that is dually written cannot be changed. 
[0037] One frame contains 1 024 PCM samples that 
are supplied through the input terminal 100. The first 
512 samples are used in the immediately preceding 
frame. The last 512 samples are used in the immedi- 
ately following frame. This arrangement is used from a 
view point of an overlap of the MDCT process. 
[0038] Returning to Fig. 1 , a normalization informa- 
tion changing circuit 1 19 generates values for changing 
scale factor information for a low band, a middle band, 
and a high band and supplies the values corresponding 
to the low band, the middle band, and the high band to 
the calculating devices 120, 121, and 122, respectively. 
The calculating device 120 adds the value supplied from 
the nonnalization information changing circuit 119 to 
the scale factor information contained in the encoded 
data supplied from the adaptive bit allocation encoding 
circuit 1 06. When the value that is output from the nor- 
malization information changing circuit 119 is negative, 
the calculating device 120 operates as a subtracting 
device. The calculating device 121 adds the value sup- 
piled from the normalization information changing cir- 
cuit 1 19 to the scale factor information contained in the 
encoded data supplied from the adaptive bit allocation 
encoding circuit 107. When the value that Is output from 
the normalization infonnation changing circuit 119 is 
negative, the calculating device 121 operates as a sub- 
tracting device. 

[0039] The calculating devk^e 122 adds the value 
supplied from the normalization information changing 
circuit 119 to the scale factor information contained in 
the encoded data supplied from the adaptive bit alloca- 
tion encoding circuit 108. When the value that is output 
from the normalization information changing circuit 119 
is negative, the calculating device 122 operates as a 
subtracting devk:e. The normalization infonnation 
changing circuit 1 1 9 operates corresponding to an oper- 
ation of the user through for example an operation 
panel. In this case, the level adjusting process, the filter- 
ing process, and so forth will be described later that the 
user desires are accomplished. Output signals of the 
calculating devices 120, 121, and 122 are supplied to a 
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conventfonal recording system (not shown) through out- 
put terminals 112, 114, and 116, respectively. The 
recording system records the output signals of the cal- 
culating devices 120, 121, and 122 to a record medium 
such as a magneto optical disc. . 5 
[0040] The recording system records at least one 
type of encoded data generated by properly controlling 
addresses of tracks formed on the record medium along 
with data that has not been processed in such a manner 
that the encoded data and non-processed data are sep- 10 
arateiy recorded. This process wiil be described later. 
Thus, at least one type of encoded data and/or pre- 
edited data are recorded on the record medium. As a 
record medium, besides a magneto optical disc, a disc 
shaped record medium (such as a magnetic disc), a is 
tape shaped record medium (such as a magnetic tape 
or an optical take), or a semiconductor memory (such 
as an IC memory, a card type memory, a memory card, 
or an optical memory) may be used. 
[0041] Next, each process will be described in 20 
detail. Fig. 5 shows an example of the structure of the 
bit allocation calculating circuit 118. Frequency-base 
spectrum data or MDCT coefficients supplied from the 
MDCT circuits 103, 104, and 105 through an input ter- 
minal 301 is supplied to an energy calculating circuit 25 
302. In addition, block size information is supplied from 
the block designating circuits 109, 11 0, and 1 1 1 through 
the input terminal 301 to the energy calculating circuit 
302.. The energy calculating circuit 302 calculates the 
sum of the amplitude values of each unit block so as to 30 
calculate the energy of each unit block. 
[0042] . Rg. 6 shows an example of an output signal 
of the energy calculating circuit 302. In Fig. 6, a spec- 
trum SB of the sum of each sub band is represented by 
a vertical line with a circle. In Fig. 6, the horizontal axis 35 
and the vertical axis represent the frequency and signal 
strength, respectively. For simplicity, in Fig. 6, only a 
spectrum B12 is denoted by "SB". The number of sub 
bands (unit blocks) is 12 (B1 to 812). Instead of the 
energy calculating circuit 302, a structural portion that 40 
calculates the peak value, average value, and so forth of 
amplitude values and performs a bit allocating process 
con^esponding to the peak value, average value, and so 
forth of the amplitude values may be disposed. 
[0043] The energy calculating circuit 302 desig- 4s 
nates a scale factor value. In reality, several positive val- 
ues are provided as alternatives of a scale factor value. 
Among them, values that are larger than the maximum 
value of absolute values of spectrum data or MDCT 
coefficients of each unit block are selected. The mini- so 
mum value of the selected values is used as a scale fac- 
tor value of the unit block. Numbers are allocated to the 
alternatives of a scale factor value using for example 
several bits. The allocated numbers are stored in for 
example ROM (Read Only Memory) (not shown). Atthis 55 
point, the alternatives of a scale factor value increment 
by for example 2 dB. A number allocated to a scale fac- 
tor value selected for a particular unit block is defined as 



scale factor information of the particular unit block. 
[0044] An output signal (namely, each value of the 
spectrum SB) of the energy calculating circuit 302 is 
supplied to a convolution filter circuit 303. The convolu- 
tion filter circuit 303 performs a convoluting process for 
multiplying a predetermined weighting function by a 
spectrum SB and adding them so as to consider the 
influence of the masking of the spectrum SB. Next, with 
reference to Fig. 6, the convoluting process will be 
described in detail. As was described above, Fig. 6 
shows an example of a spectrum SB of each block. In 
the convoluting process of the convolution filter circuit 
303, the sum of portions denoted by dotted lines is cal- 
culated. The convolution filter circuit 303 can be com- 
posed of a plurality of delaying devices, a plurality of 
multiplying devices, and a sum adding device. Each of 
the delaying devices successively delays the input data. 
Each of the multiplying devices multiplies output data of 
a relevant delaying device by a filter coefficient (weight- 
ing function). The sum adding device adds the output 
data of the multiplying devices. 
[0045] Returning to Fig. 5, an output signal of the 
convolution filter circuit 303 is supplied to a calculating 
device 304. An allowance function (that represents a 
masking level) Is supplied from an (n - ai) function gen- 
erating circuit 305 to the calculating device 304. The 
calculating device 304 calculates a level a con-espond- 
ing to an allowable noise level in an area convoluted by 
the convolution filter circuit 303 with tiie allowance func- 
tion. As will be described later, the level a con-espond- 
ing to the allowable noise level is an allowable level of 
each critical band as a result of an inversely convoluting 
process. The calculated value of the level a is controlled 
by increasing/decreasing the allowance function. 
[0046] In other words, when the numbers allocated 
from the lowest critical band are denoted by I, the level 
a corresponding to the allowable noise level can be 
obtained by the following fomiula (1 ). 

a = S-(n-ai) (1) 

wherein n and a are constants; a > 0; S is the strengtii 
of a convoluted spectrum. In formula (1), (n - ai) is an 
allowance function. In this example, n » 38 and a = 1 are 
given. 

[0047] The level a calculated by the calculating 
device 304 is supplied to a dividing device 306. The 
dividing device 306 inversely convolutes the level a. As 
a result, the dividing device 306 generates a masking 
spectrum corresponding to the level a. The masking 
spectrum is an allowable noise spectrum. When the 
inversely convoluting process is perfonned, compli- 
cated calculations are required. However, according to 
the first embodiment of the present invention, with the 
dividing device 306 that Is simply structured, the 
inversely convoluting process is perfomied. The mask- 
ing spectrum Is supplied to a combining circuit 307. In 
addition, data that represents a minimum audible curve 
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RC (that wilfbe described later) is supplied from a min- 
imum audibie curve generating circuit 312 to the com- 
bining circuit 307. 

[0048] The combining circuit 307 combines the 
mas)<ing spectrum that is output from the dividing 
device 306 and the data that represents the minimum 
audible curve RC and generates a masl^ing spectrum. 
The generated masl<ing spectrum is supplied to a sub- 
tracting device 308, The timing of an output signal of the 
energy calculating circuit 302 (namely, the spectrum SB 
of each sub band) is adjusted by a delaying circuit 309. 
The resultant signal is supplied to the subtracting device 
308. The subtracting device 308 performs a subtracting 
process corresponding to the masking spectrum and 
the spectrum SB. 

[0049] As the result of the process, the spectrum 
SB of each biocl< is masl<ed so that the portion that is 
smaller than the level of the maslcing spectrum is 
masl<ed. Fig. 7 shows an example of the maslcing proc- 
ess. Referring to Fig. 7, the portion that is smaller than 
the level of the maslcing spectrum (MS) of the spectrum 
SB is masked. For simplicity, in Fig. 7, only the spectrum 
B12 is denoted by "SB" and the level of the masking 
spectrum is denoted by "MS". 
[00501 When the noise absolute level is equal to or 
smaller than the minimum audible curve RC, the noise 
is inaudible for humans. The minimum audible curve 
varies corresponding to the reproduction volume even 
in the same encoding method. However, in a real digital 
system, music data in for example a 16-bit dynamic 
range does not largely vary. Thus, assuming that the 
quantizing noise of the most audible frequency band at 
around 4 kHz is inaudible, it is supposed that the quan- 
tizing noise that is smaller than the level of the minimum 
audible curve is inaudible In other frequency bands. 
[0051] Thus, when noise at around 4 kHz of a word 
length of the system is prevented from being audible, if 
the allowable noise level is obtained by combining the 
minimum audible curve RC and the masking spectrum 
MS, the allowable noise level can be represented as a 
hatched portion shown In Fig. 8. In this example, the 
level at 4 kHz of the minimum audible curve is set to the 
minimum level equivalent to for example 20 bits. In Fig, 
8, SB of each block is denoted by a solid line, whereas 
MS of each block is denoted by a dotted line. However, 
in Rg. 8, for simplicity, only the spectrum B12 is repre- 
sented with "SB", "MS", and "RC". In Fig. 8, a signal 
spectrum SS is denoted by a dashed tine. 
[0052] Returning to Fig. 5, an output signal of the 
subtracting device 308 is supplied to an allowable noise 
compensating circuit 31 0. The allowable noise compen- 
sating circuit 31 0 compensates the allowable noise level 
of the output signal of the subtracting device 308 corre- 
sponding to for example data of an equal roundness 
curve. In other words, the allowable noise compensat- 
ing circuit 310 calculates allocated bits for each unit 
block corresponding to various parameters such as the 
above-described masking and hearing characteristic. 



An output signal of the allowable noise compensating 
circuit 310 Is obtained as the final output data of the bit 
allocation calculating circuit 118 through an output ter- 
minal 311. In this example, the equal roundness curve 

5 is a characteristic curve that represents the hearing 
characteristic of humans. For example, the sound pres- 
sure of a sound at each frequency that is heard with the 
same strength of a pure sound at 1 kHz is plotted. The 
potted points are connected and represented as a 

70 curve. This curve is referred to as roundness equal sen- 
sitivity curve. 

[0053] The equal roundness curve matches the 
minimum audible curve shown in Fig. 8. On the equal 
roundness curve, although the sound pressure at 

15 around 4 kHz is smaller than that at 1 kHz by 8 to 1 0 dB, 
the strength at 4 kHz is the same as that at 1 kHz. In 
contrast, unless the sound pressure at 50 Hz is larger 
than that at 1 kHz by around 15 dB, the strength at 50 
Hz is not the same as that at 1 kHz. Thus, when noise 

20 that exceeds the level of the minimum audible curve RC 
(namely, the allowable noise level) has a frequency 
characteristic corresponding to the equal roundness 
curve, the noise can be prevented from being audible to 
humans. Thus, it is clear that in consideration of the 

25 equal roundness curve, the allowable noise level can be 
compensated con^esponding to the hearing characteris- 
tics of humans. 

[0054] Next, scale factor infonnation will be 
described in detail. As alternatives of a scale factor 

30 value, a plurality of positive values (for example, 63 pos- 
itive values) are stored in for example a memory of the 
bit allocation calculating circuit 1 1 8. Values that exceed 
the maximum value of the absolute values of the spec- 
trum data or MDCT coefficients of a particular unit block 

35 are selected from the alternatives. The minimum value 
of the selected values is used as the scale factor value 
of the particular unit block. A number allocated to the 
selected scale factor value is defined as scale factor 
infomnation of the particular unit block. The scale factor 

40 information is contained in the encoded data. The posi- 
tive values as the alternatives of a scale factor value are 
allocated with numbers of six bits. The positive values 
increment by 2 dB. 

[0055] When the scale factor information is control- 
45 led with an adding operation and a subtracting opera- 
tion, the level of the reproduced audio data can be 
adjusted with an increment of 2 dB. For example, when 
the same values that are output from the normalization 
infonnation changing circuit 119 are added or sub- 
so tracted to/from the scale factor information of all the unit 
blocks, the levels of all the unit blocks can be adjusted 
by 2 dB. The scale factor information generated as the 
result of the adding/subtracting operations is limited to 
the range defined in the applied format 
55 [0056] Alternatively, when different values that are 
output from the normalization information changing cir- 
cuit 1 1 9 are added or subtracted to/from the scale factor 
infonnation of the respective unit blocks, the levels of 
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"the unit blocks can be separately adjusted. As a result, 
a filtering function can be accomplished. In nnore reality, 
when the normalization information changing circuit 1 1 9 
outputs a pair of a unit block number and a value to be 
added or subtracted to/from the scale factor information 
of the unit block, unit blocks and values to be added or 
subtracted to/from scale factor information of the unit 
blocks are correlated. 

[0057] By changing scale fiactor information in the 
above-described manner, functions that will be 
described with reference to Figs. 1 0, 1 1 , and 12 can be 
accompfished. In addition, a digital signal recording 
apparatus that perfonns other than QMF and MDCT 
processes as the sub band coding method and the 
encoding method Is known. For example, when an 
encoding method for performing a quantizing operation 
using normalization information and bit allocation infor- 
mation (for example, a method corresponding to the sub 
band coding method using for example filter banks is 
used, the editing process for changing normalization 
information can be performed. 

[0058] Next, with reference to Fig. 9, an example of 
the structure of a digital signal reproducing and/or 
recording apparatus according to the present invention 
will be described. Encoded data that Is reproduced from 
a record medium such as a magneto optical disc is sup- 
plied to an input terminal 707. In addition, block size 
information used in the encoding process (namely, data 
equivalent to output signals of the output temninals 113, 
1 15, and 1 17 shown In Fig. 1) is supplied to an input ter- 
minal 708. In addition, a normalization infonmation 
changing circuit 709 generates a parameter used for the 
editing process con-esponding to a user's command 
that is input through for example an operating panel (the 
parameter is for example, a value to be added or sub- 
tracted to/from scale factor information of each unit 
block). 

[0059] The encoded data is supplied from the input 
terminal 707 to a calculating device 71 0. The calculating 
device 710 also receives numeric data from a nomnali- 
zation information changing circuit 709. The calculating 
devices adds the numeric data is supplied from the nor- 
malization information changing circuit 1 1 9 correspond- 
ing to supplied scale factor information of encoded data. 
When the numeric value that is output from the normal- 
ization information changing circuit 709 Is a negative 
value, the calculating device 710 operates as a sub- 
tracting device. An output signal of the calculating 
device 710 is supplied to an adaptive bit allocation 
decoding circuit 706 and an output terminal 71 1 . 
[0060] The adaptive bit allocation decoding circuit 
706 references the adaptive bit allocation infomriation 
and deallocates the allocated bits. An output signal of 
the adaptive bit allocation decoding circuit 706 is sup- 
plied to inversely orthogonally transfomiing circuits 703, 
704, and 705. The inversely orthogonally transfomiing 
circuits 703, 704, and 705 transform a frequency-base 
signal into a time-basis signal. An output signal of the 



inversely orthogonally transforming circuit 703 is sup- 
plied to a band combining filter 701. Output signals of 
the inversely orthogonally transforming circuit 704 and 
705 are supplied to a band combining filter 702. Each of 

5 the inversely orthogonally transforming circuits 703, 
704, and 705 may be composed of an Inversely modi- 
fied DCT transforming circuit (IMDCT). 
[0061] The band combining filter 702 combines 
supplied signals and supplies the combined result to the 

10 band combining filter 701. The band combining filter 

701 combines supplied signals and supplies the com- 
bined result to a terminal 700. In such a manner, time- 
base signals of separated sub bands that are output 
from the inversely orthogonally transforming circuits 

75 703, 704, and 705 are decoded into a signal of the 
entire band. Each of the band combining filters 701 and 

702 may be composed offer example an IQMF (Inverse 
Quadrature Min-or Filter). Decoded signals of the entire 
band are supplied to a general configuration for output- 

20 ting the reproduction sound contains D/A converter, a 
speaker or so forth (not shown) via the output tenninal 
700. 

[0062] By operating scale factor information with an 
adding operation or a subtracting operation of the calcu- 

25 lating device 710, the level adjustment of the repro- 
duced data can be performed every for example 2 dB. 
When the normalization infonnation changing circuit 
709 outputs the same value and adds or subtracts the 
value to/from scale factor information of each unit block. 

30 Thus, the level adjustment of each unit block can be per- 
formed for 2 dB. In such a process, scale factor informa- 
tion generated as a result of the adding/subtracting 
operation Is limited in the range of scale factor values 
defined con-esponding to the applied fonnat. 

35 [0063] Alternatively, when the nomializatlon infor- 
mation changing circuit 709 outputs a different value for 
each unit block and adds or subtracts the different value 
to/from scale factor information of each unit block, the 
level adjustment of each unit block can be performed. 

40 As a result, a filter function can be accomplished. In 
reality, the normalization information changing circuit 
709 outputs a set of each unit block number and a value 
to be added or subtracted theretolherefrom. Thus, 
each unit block can be correlated with a value to be 

4S added or subtracted to/from scale factor information. 
[0064] Next, an editing process performed by 
changing scale factor information will be described in 
detail. Fig. 1 0 shows an example of a block floating 
process as a normalizing process affected to encoded 

50 data that is output from the adaptive bit allocation 
encoding circuit 706. In Rg. 10, it is assumed that 10 
normalization levels 0 to 9 are prepared. The maximum 
spectrum data in the individual unit blocks or a normali- 
zation level number corresponding to the minimum nor- 

55 malization level that is larger than MDCT coefficients is 
treated as scale factor information of the cun^ent unit 
block. Thus, in Fig. 1 0, the scale factor information cor- 
responding to the block number 0 is 5, whereas the 
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scale factor* information corresponding to the block 
number 1 is 7. This designation applies to other blocks. 
As was described above with reference to Rg. 3, scale 
factor infonnation is written to encoded data. Generally, 
corresponding to normalization information, data is 
decoded. 

[0065] Rg. 1 1 shows an example of the operation of 
scale factor information shown in Fig. 10. When the nor- 
malization infomnation changing circuit 119 outputs a 
value "-r for all unit blocks and the calculating devices 
120, 121, and 122 add the value "-1" to scale factor 
infonnation as shown in Fig. 1 0, scale factor infomnation 
becomes a value smaller than the original value by "1". 
In such a process, spectrum data or an MDCT coeffi- 
cient of each unit block is decoded as a value that is 
smaller than the original value by 2 dB. In other words, 
the level adjustment is performed so that the signal level 
is lowered by for example 2 dB. 
[0066] Rg. 1 2 shows another example of a process 
performed by the normalization information changing 
circuit 709 for scale factor infonnation contained in 
encoded data. As shown in Fig. 10, when the normali- 
zation information changing circuit 119 output the value 
"-6" for the block of the block number 3 and the value 
4" for the block of the block number 4 and then these 
values are added to scale factor information of the 
blocks of the block numbers 3 and 4, the scale factor 
values of the blocks of the block numbers 3 and 4 
become "0" as shown in Rg. 12. As a result, a filtering 
process is performed. In the example shown in Fig. 12, 
by adding negative values (or subtracting positive val- 
ues) to scale factor values, they become "0". Alterna- 
tively, a scale factor value of a desired block may be 
fon:edlysetto"0". 
. [0067] in the examples shown in Rgs. 10 to 12, the 
number of unit blocks is five (unit block 0 to unit block 4) 
and the number of normalization alternatives is 1 0 (nor- 
malization alternative 0 to 9). However, in the format of 
a real record medium such as an MD (Mini Disc) that a 
magneto optical disc, the number of unit blocks is 52 
(unit block 0 to unit block 51) and the number of normal- 
ization alternatives is 64 (normalization alternative 0 to 
normalization alternative 63). In such a range, by finely 
designating unit blocks and parameters for changing 
scale factor infonnation and so forth, the level adjusting 
process, the filtering process, and so forth can be more 
precisely perfonned. 

[0068] When a recording system is added to the 
structure portion shown in Fig. 9, data recorded on a 
record medium can be rewritten corresponding to an 
edited result The record medium is for example a disc 
shaped record medium (such as an magneto optical 
disc or a magnetic disc), a tape shaped record medium 
(such as a magnetic tape or an optical tape), or a semi- 
conductor memory (such as an IC memory, a memory 
stick, or a memory card). When an edited result is out- 
put through an output temninal 711 shown in Fig. 9 and 
written to a record medium, scale factor information can 



be written to a record medium using such a simple 
structure. Thus, with reference to a reproduced result 
(namely, while listening to a reproduced sound), the 
user or the like can perfonn an editing process and 

5 cause the recording system to rewrite data recorded on 
the record medium corresponding to the edited result. 
Thus, the result of the editing process due to a change 
of normalization information or the like can be stored. In 
addition, a record medium on which the result of the 

10 editing process has been recorded can be provided. 
[0069] As the result of the editing process due to a 
change of scale factor information described with refer- 
ence to Figs. 10 to 12, various functions such as a 
reproduction level adjusting function, a fade-in function, 

15 a fade-out function, a filtering function, and a wowing 
function can be accomplished. However, the level 
adjustment is performed corresponding to at most an 
increase or decrease of one value of nomnalization 
information (for example, 2 dB). In other words, the level 

20 adjustment cannot be performed in the accuracy lower 
than 2 dB. Likewise, in the chronological direction, the 
level adjustment is performed in the encoding data for- 
mat corresponding to the applied format (for example, in 
the accuracy of at most one frame or the like). 

25 [0070] To solve such problems, according to the 
present invention, encoded data is temporarily decoded 
to PCM samples. Thereafter, the PCM samples are 
edited in a desired manner. Thereafter, the edited PCM 
samples are encoded once again. As a result, encoded 

30 data is obtained. However, since each frame of encoded 
data contains data that overlaps with the adjacent 
frames, a process in consideration with the overlapped 
portions is required. This process will be described 
next. As was described above, one frame is composed 

35 of for example 1024 PCM samples. In the processes 
perfonned by the MDCTs 103, 104, and 105, each 
frame that is successively processed has an overlap 
portion of samples. An example of such a process is 
shown in Fig. 13. When 1024 samples that are sample 

40 n to sample n+1023 are processed in a frame N, 1024 
PCM samples that are sample n+512 to sample n+1535 
are processed in a frame N+1, whereas 1024 PCM 
samples that are sample n+1 024 to sample n4-2047 are 
processed in a frame N+2. 

45 [0071] However, in the first frame, it is assumed 
before the sample sequence begins, there are 512 zero- 
data PCM samples as a virtual frame. The first frame is 
processed so that it overlaps with the virtual frame. 
Likewise, in the last frame, it is assumed after the sam- 

50 pie sequence ends, there are 512 zero-data PCM sam- 
ples as a virtual frame. The last frame is processed so 
that it overlaps with the virtual frame. In such a process, 
the number of samples substantially processed is 512. 
[0072] As was descried above, by changing scale 

55 factor infonnation, an editing process can be perfonned 
for each frame. However, in the MDCT process for each 
frame, it is clear that the overlap portion should be con- 
sidered. This point will be described in reality with refer- 
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ence to^^ig. 13. In Fig. 13, PCM samples are denoted 
" as a set of points arranged in the chronological direc- 
tion. When an editing process for changing scale factor 
information for the frame N and the frame N+1 , the level 
adjusting function or the like as an editing process is 5 
accomplished for the PCM samples n+512 to the PCM 
samples n+1 023. However, since the PCM sample n to 
the sample n-i-51 1 arid the PCM sample n+1 024 to the 
PCM sample n+1 535 overlap with adjacent frames that 
have not been edited, the function of the editing process 10 
is not accomplished for these PCM samples. 
[0073] In addition, the level adjustment is performed 
corresponding to an increase or decrease of at most 
one value of normalization information (for example, 2 
dB). In addition, the filter function or the like is restricted is 
with the number of unit blocks of one frame and a fre- 
quency division width corresponding to each unit block. 
In other words, the editing process is restricted corre- 
sponding to the applied encoding method and encoding 
data format 20 
[0074] Fig. 1 4 shows an example of the structure for 
temporarily decoding encoded data, performing an edit- 
ing process for decoded PCM samples, and encoding 
the edited PCM samples once again according to the 
present invention. Encoded data is supplied to a decod- 25 
ing circuit 802 through a temninal 801 . The decoding cir> 
cuit 802 partly decodes the supplied encoded data and 
generates PCM samples. The decoding circuit 802 
partly decodes the encoded data coresponding to a 
command issued by the user or the like through for 30 
example an operation panel. In other words, the user 
can designate a portion of encoded data that is 
decoded by the decoding circuit 802. The decoding cir- 
cuit 802 generates PCM samples and supplies them to 
a memory 803. The memory 803 temporarily stores the 35 
PCM samples. 

[0075] A data modlfing circuit 804 perfonns one of 
various modifing processes as editing processes for the 
PCM samples stored in the memory 803. Examples of 
the modifing processes are a reverb process, an echo 40 
process, a filtering process, a compressor process, and 
an equalizing process. The data modifing circuit 804 
supplies the modified PCM samples to a delay compen- 
sating circuit 805. The delay compensating circuit 805 
performs a delay compensating process for the modi- 45 
fied PCM samples. The compensated PCM samples 
are temporarily stored in a memory 806. An encoding 
circuit 807 perfomns an encoding process for the PCM 
samples stored in the memory 806. The encoding cir- 
cuit 807 outputs the generated encoded data to an out- so 
put terminal 808. Thus, encoded data that has been 
edited can be recorded to a record medium through the 
output terminal 808. 

[0076] Next, the process of the delay compensating 
circuit 805 will be descried in detail. The delay compen- ss 
sating process is a phase adjusting process for conv 
pensating a time lag of the output data of the encoding 
circuit 807 against the encoded data that is input from 



the terminal 801 due to the operation time periods of the 
decoding circuit 802 and the encoding circuit 807. Thus, 
the delay compensating circuit 805 secures the chrono- 
logical relation between a frame that is output from the 
encoding circuit 807 and a frame that Is input from the 
tenninal 801. The delay amount depends on the struc- 
ture of a band dividing filter or a band combining filter 
(for example, the number of banks, an input timing of 
such a filter, the number of zero-data PCM samples, 
and a buffering using windows in the MDCT process). 
[0077] For example, the number of banks of each of 
the band dividing filters 101 and 102 shown in Fig. 1 is 
48. Likewise, the number of banks of each of the band 
combining filters 702 and 701 shown in Fig. 9 is 48. 
When 512 zero-data PCM samples are used for a vir- 
tual frame that overlaps with the first frame, the delay 
amount due to the encoding process and the decoding 
process becomes 653 PCM samples. The delay com- 
pensating circuit 805 may be disposed at any position 
between the output of the decoding circuit 802 and the 
output of the encoding circuit 807. The delay compen- 
sating circuit 805 may have a buffer memory or the like 
for compensating the delay amount. Alternatively, the 
delay compensating circuit 805 may be a timing control- 
ling circuit that controls the memories 803 and 806 so 
that they are accessed at timings in consideration of the 
delay amount. 

[0078] The decoding circuit 802 shown in Fig. 14 
has the structure shown in Fig. 9. On the other hand, the 
encoding circuit 807 shown in Fig. 14 has the structure 
shown in Fig. 1. The structure portion shown in Fig. 14 
temporarily decodes encoded data, performs an editing 
process for the decoded PCM sampies, encodes the 
edited PCM samples, and writes the generated 
encoded data to a record medium. Besides a magneto 
optrcal disc, an example of the record medium may be a 
disc shaped record medium (such as a magnetic disc), 
a tape shaped record medium (such as a magnetic tape 
or an optical tape), or a semiconductor memory (such 
as an IC memory, a memory stick, or a memory card). 
[0079] Next, with reference to Fig. 1 6, the chrono- 
logical relation between the encoded data that Is sup- 
plied through the input temnlnat 801 and the encoded 
data that is output through the output terminal 808. In 
Rg. 16, frames N-1, N, N+1, N+2, and N+3 shown in 
Fig. 16 represent frames in the encoded data that is 
input through the input terminal 801. PCM samples 
decoded from these frames are denoted as a set of 
points that are arranged in the chronological direction. 
The chronological relation of the decoded PCM sam- 
ples does not vary even if the amplitude value of the sig- 
nal shown in Rg. 12 is edited. However, to maintain the 
chronological relation between frames of encoded data 
generated by the encoding circuit 807 and frame of 
encoded data that has not been edited, the delay* for 
653 points should be compensated. 
[0080] When the first frame of encoded PCM sam- 
ples that have been delay compensated is denoted by a 
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frame M-1 , the last 51 2 PCM samples of the frame M-1 
are 512 PCM samples starting from the position of 
which the decoded PCM samples are delayed by 663 
samples. At this point, since the frame M-1 is the first 
encoded frame, the first 512 PCM samples of the frame 
M-1 are zero-data PCM samples. Thereafter, the 
frames M+1, M+2, and M+3 are successively encoded 
and output through the output terminal 808. In this case, 
the frame M-1 corresponds to the frame N-1; the frame 
M corresponds to the frame N; the frame M+1 corre- 
sponds to the frame N+1 ; the frame M+2 corresponds to 
the frame N+2; and the frame M+3 corresponds to the 
frame N+3. 

[0081] In such a relation, to generate PCM samples 
of for example the frame M, it is necessary to decode 
the frames N-1 to N+1 . In other words, to edit a desired 
frame and then encode It, at least one preceding frame 
and one following frame of the current frame are 
required. 

[0082] However, for the frames M-1 , M, and M+1 
that ara output from the output tenninal 808, the relation 
of an overlap should be considered. In other words, in 
the case that a portion e shown in Fig. 1 6 is edited, if the 
frame N is edited and then substituted with the frame M, 
due to the overlap portion with the frame M+1 , a desired 
edit result cannot be obtained. In this case, to obtain a 
desirad edit result, it is necessary to edit the frame N+1 
and then replace the result with the frame M+1 . In this 
case, as was descried above, it is necessary to decode 
the frames N to N+3. 

[0083] In other words, to edit the portion e and 
obtain a desired rasult, the frames N-1 to N+3 are 
extracted and decoded. Thus, PCM samples are gener- 
ated and edited. As a result, the frames M and M+1 are 
obtained and used instead of the frames N and N+1. In 
addition, by considering the chronological relation 
between data generated for obtaining a desired edit 
result and a frame to be decoded for generating PCM 
samples, data for a relatively long time period can be 
edited. In addition, according to the embodiment of the 
present invention, an influence of windows in the 
orthogonal transform is not considered. However, to 
consider it, the editing process can be finely performed. 
[0084] This point will be described practicaiiy with 
reference to Figs. 15A, 15B, and 15C. 
[0085] Rg. ISA shows a signal recorded on a 
record medium. In Fig. 15A, F1, F2, F3, F4, F5, and F6 
denote frames formed on a record medium. Each frame 
is a data record unit. Each frame contains a digital 
encoded signal as represented by a signal waveform. 
[0086] Next, the case of which an effect process is 
perfonned for the frames F3 and F4 shown in Fig. 15A 
will be described. 

[0087] The frames F3 and F4 to which the effect 
process is perfonnned are input to the temriinal 801 
shown in Fig. 14. Thereafter, the frames F3 and F4 are 
supplied to the decoding circuit 802. The decoding cir- 
cuit 802 decodes the frames F3 and F4 and supplies the 



decoded frames to the memory 803. The memory 803 
stores the decoded frames. The digitally decoded sig- 
nals of the frames F3 and F4 stored in the memory 803 
are supplied to the data changing circuit 804. The data 

5 changing circuit 804 performs the effect process for the 
digitally decoded signals of the frames F3 and F4. The 
decoding process and the effect process result in a 
delay D2 as shown in Fig, 15B. In other words, as was 
described above, for the frame F3 as the first frame, 512 

10 zero-data PCM samples are used as a virtual frame that 
precedes the first frame F3. The frame F3 is processed 
so that it overlaps with the virtual frame. When the proc- 
essed results of the frames F3 and F4 are denoted by 
frame DF3 and DF4, respectively, they can be repre- 
ss sented as a part of a waveform having a delay D2. In 
other words, the frames DF3 and DF4 are generated as 
a part of the signal wave of which a zero-data signal is 
filled before the signal wave shown in Fig. 15A starts. 
[0088] When a signal with a delay D1 is encoded by 

20 the encoding circuit 807, as with the case of the decod- 
ing process, the delay D2 takes place. As a part of a sig- 
nal of which the delay D1 and the delay D2 ara added in 
the signal waveform shown in Fig. 15 A, frames DDF3 
and DDF4 are generated. In other words, the frames 

25 DDF3 and DDF4 are generated as a part of the signal 
waveform of which zero-data signal is filled in the period 
of the delay D1 and the delay D2 from the beginning of 
the frame 1 of the record medium. 
[0089] When the frames DDF3 and DDF4 are 

30 rewritten to positions on the record medium correspond- 
ing to the time infomnation of the frames DDF3 and 
D0F4, if the delay compensating process of the delay 
compensating circuit 805 have not been perfonned for 
the frames DDF3 and DDF4, the frame DDF3 is over- 

35 written to the positions of the frames F5 and F6 on the 
record medium. On the other hand, the frame DDF4 is 
ovenivritten to the positions of the frames F6 and F7 on 
the record medium. 

[0090] Thus, the frames Fl, F2, F3, and F4, a part 
40 of the frame F5, the frames DDF3 and DDF4 that have 
been effect processed, and a part of the frame F7 have 
been recorded on the record medium. As a result, the 
continuity of the signal is lost. 

[0091] To solve this problem, the time information of 
45 the generated frames DDF3 and DDF4 is offset by the 
total time period of the delay amounts D1 and D2. Thus, 
the frames DDF3 and DDF4 can be rewritten to the 
positions of the frames F3 and F4 on the record 
medium, respectively As a result the continuity of the 
50 signal is secured. In addition, a record medium contains 
frames that have been effect processed can be pro- 
vided. 

[0092] Next, the case of which a part of encoded 
PCM data recorded on a record medium is decoded, 
55 edited, and then rewritten to the record nredium will be 
described with reference to Figs, 1 7A, 1 7B, and 1 7C. 
[0093] Fig. 17A shows the case that input PCM 
data is filtered with windows and encoded for each 
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Tframe. In this example, the size of each window is the 
same as the size of each frame. In this example, the ' 
size of each window is 1024 samples. 
[0094] For example, a frame N of the input PCM 
data is filtered with three windows W2, W3. and W4 and s 
then combined. 

[0095] When a portion A of the PCM data shown in 
Fig. 17A is encoded, the portion A Is generated with 
frames N-2 and N-1. In addition, PCM data that has 
been filtered with the window W1 and W2 Is used. io 
[0096] Since the portion A is the beginning portion 
of the PCM data, there is only one adjacent frame that 
is one side of the frame N. Thus, null-data should be 
added to a frame corresponding to the first half of the 
window W1 . As a result, one of the two adjacent frames is 
of the portion A is a null-frame. 

[0097] When PCM data shown In Fig. 17A is 
encoded, the frames N-1 , N, N+1 , N+2, and N+5 are 
recorded to the record medium. However, the null-frame 
is not recorded to the record medium. Thus, only the 20 
minimum number of frames that compose the input 
PCM data is recorded on the record medium, in other 
words, frames that are required for the encoding proc- 
ess are not recorded to a record medium. 
[0098] Next, with reference to Fig. 1 7B, the case of 25 
which a part of PCM data that has been encoded and 
. recorded on a record medium as shown in Fig. 17A will 
be described. 

[0099] In this example, a portion EDIT shown in Fig. 
1 7B of PCM data that has been encoded and recorded 30 
on a record medium as shown in Fig. 17A is edited. In 
this case, the frames N, N+1 , N+2, and N+3 should be 
decoded. In the example shown in Fig. 17B, for easy 
. understanding, the frame N-1 is also decoded. 
[0100] When the five frames are decoded, since the 35 
first frame N-1 and the last frame N+3 each have one 
adjacent frame, they canot be decoded. Thus, to 
decode the frames N-1 and N+3, null-frames are used 
as their adjacent frames. The decoded PCM data is 
edited. As was described above, the start position of the 40 
frame N-1 chronologically deviates due to phase delays 
of the null-frame and the number of banks of the filter by 
653 frames. 

[0101] When the portion EDIT of the decoded PCM 
data is edited, it is clear that the wavefonn correspond- 45 
ing to the data recorded on the record medium is differ- 
ent from the waveform of the edited portion. 
[0102] The reason why the waveform of the second 
half of the frame N+3 is different from the wavefomn cor- 
responding to the data recorded on the record medium so 
is in that when the second half of the frame N+3 is 
decoded, the null-frame is used instead of the first half 
of the frame N+4. 

[0103] On the other hand, since the frame N-1 is 
encoded using a null-frame, when the frame N-1 is 55 
decoded, the waveform of the PCM signal decoding 
using the null-frame is the same as the wavefomi of the 
input PCM signal. 



[01 04] It is necessary to rewrite the edited PCM sig- 
nal to the relevant frame positions on the record 
medium. 

[0105] At this point, when the PCM signal is 
encoded with the same widows shown in Rg. 17A 
(namely, the windows W1, W2. W3, and so forth), 
these windows deviate by the delay in the decoding 
process. 

[0106] To soh^e this problem, when a signal is fil- 
tered with new windows W11, W12, W12, W13, ... and 
W16 as shown in Fig. 178, a signal with the same 
chronological relation as that shown in Fig. 17A can be 
obtained. 

[0107] Thus, it can be said that the window W11 
shown in Rg. 178 corresponds to the windows W1 
shown in Fig. 17A; the window W12 shown in Rg. 17B 
corresponds to the window W2 shown in Fig. 17A; and 
the window W13 shown in Fig. 1 78 corresponds to the 
window W3 shown in Fig. 17A. 
[01 08] As a result, when the filtering positions using 
windows are moved corresponding to the delay com- 
pensation amount as shown in Rg. 17C, the encoded 
frames N, N+1, and N+2 can be rewritten to the frame 
positions corresponding thereto on the record medium. 
[0109] According to the first embodiment and the 
second embodiment of the present invention, in a com- 
bination of MDCT, band division considering the hearing 
characteristics of humans, and bit allocations of individ- 
ual sub bands, a nonnalizing process and a quantizing 
process are performed In each sub band for encoded 
data con-esponding to a highly efficiently encoding 
method. Alternatively, the present invention can be 
applied to another encoding method such as an encod- 
ing data format corresponding to the MPEG audio 
standard. Rg. 18 shows an encoding data format con-e- 
sponding to the MPEG audio standard. 
[0110] The header is composed of 32 bits (fixed 
length). The header contains information of a synchro- 
nous word, an ID, a layer, a protection bit, a bit rate 
index, a sampling frequency, a padding bit, a private bit, 
a mode, a copyright protection slate code, an origi- 
nal/copy representing code, an emphasis, and so forth. 
The header is followed by optional error check data. The 
error check data is followed by audio data. Since audio 
data contains ring allocation information and scale fac- 
tor infomnation along with sample data, the present 
invention can be applied to such a data format. 
[0111] As normalization information, other than 
scale factor information may be used corresponding to 
the encoding method. In such a case, the present inven- 
tion can be applied. 

[01 12] According to the present invention, encoded 
data that is temporarily formed corresponding to for 
example a digital audio signal is partly decoded, edited, 
and then encoded once again. Thus, restrictions due to 
the level adjustment width, the filter function, and the 
chronological process can be suppressed in the editing 
process. Thus, data can be more finely edited. 
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[0113] [Having described a specific preferred 
embodiment of the present invention with reference to 
the accompanying drawings, it is to be understood that 
the invention is not limited to that precise embodiment, 
and that various changes and modifications may be 
effected therein by one skilled in the art without depart- 
ing from the scope or the spirit of the invention as 
defined in the appended claims. 

Claims 
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A digital signal processing apparatus for processing 
an input digital signal that has been segmented as 
blocks each having a predetermined data amount 
and highly efficiently encoded along with adjacent 
blocks, comprising: 

decoding means for decoding the highly effi- 
ciently encoded digital signal along with adja- 
cent blocks; 

modrfing process means for modifing the 
decoded digital signal; 

encoding means for highly efficiently encoding 
the modified digital signal along with adjacent 
blocks; and 

delay compensating means for compensating a 
delay of the decoded signal decoded by said 
decoding means. 

The digital signal processing apparatus as set forth 
in claim 1, 

wherein said encoding means has: 

band dividing means for dividing the input dig- 
ital signal into a plurality of frequency band 

components; 

block segmenting and encoding means for seg- 
menting a sequence of samples of which the 
plurality of frequency band components is 
ananged in chronological direction and/or fre- 
quency direction into blocks and encoding the 
blocks; 

nomnalization processing means for nomnaliz- 
ing each block that has been encoded by said 
encoding means and generating normalization 
information; 

quantized coefficient calculating means for cal- 
culating a quantized coefficient that represents 
the feature of each block of the signal compo- 
nents; 

bit allocating means for deciding the bit alloca- 
tion amount of each block con-esponding to the 
quantized coefficient calculated by said quan- 
tized coefficient calculating means; and 
encoded data generating means for re-quantiz- 
ing each block corresponding to the normaliza- 
tion information generated by said 
nomnalization processing means and the bit 
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allocation amount allocated by said bit allocat- 
ing means and generating encoded data corre- 
sponding to a predetermined format 

The digital signal processing apparatus as set forth 
in claim 1 , 

wherein said decoding means decodes the 
digital signal con^esponding to the infomnation com- 
pressed parameter for each block. 

The digital signal processing apparatus as set forth 
in claim 1 , further comprising: 

operating means for allowing the user to desig- 
nate a highly efficiently encoded digital signal 
to be edited. 

The digital signal processing apparatus as set forth 
in claim 1 , 

wherein the input signal that has been highly 
efficiently encoded is read from a record medium. 

The digital signal processing apparatus as set forth 
in claim 5, 

wherein a delay of the digital signal that has 
been highly efflcientiy encoded by said encoding 
means is compensated by said delay compensating 
means and then the compensated signal is written 
to the record medium so that the phase of tiie com- 
pensated signal matches the phase of the digital 
signal that has been read from the record medium. 

A digital signal processing method for processing 
an input digital signal that has been segmented as 
blocks each having a predetermined data amount 
and highly efficiently encoded along with adjacent 
blocks, comprising the steps of: 

(a) decoding the highly efficiently encoded dig- 
ital signal along with adjacent blocks; 

(b) modifing the decoded digital signal; and 

(c) highly efficiently encoding the changed dig- 
ital signal along with adjacent blocks and com- 
pensating a delay of the decoded signal 
decoded at step (a). 

The digital signal processing method as set forth in 
claim 7, 

wherein step (a)' has the steps of: 

(d) dividing the input digital signal into a plural- 
ity of frequency band components; 

(e) segmenting a sequence of samples of 
which the plurality of frequency band compo- 
nents is arranged in chronological direction 
and/or frequency direction into blocks and 
encoding the blocks; 

(f) nomnatizing each block that has been 
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Isncoded by said encoding means and generat- 
ing normalization information; 

(g) calculating a quantized coefficient that rep- 
resents the feature of each block of the signal 
components; 5 

(h) deciding the bit allocation amount of each 
biocl< corresponding to the quantized coeffi- 
cient calculated at step (g); and 

(i) re-quantizing each blocl< corresponding to 
the nonnalization Infomnatlon generated at io 
step (f) and the bit allocation amount allocated 

at step (h) and generating encoded data corre- 
sponding to a predetemnined format. 

9. The digital signal processing method as set forth in is 
claim 7, 

wherein step (a) is performed by decoding 
the digital signal corresponding to the infomnation 
compressed parameter for each block. 

20 

10- The digital signal processing method as set forth in 
claim 7, further comprising the step of: 

(j) allowing the user to designate a highly effi- 
ciently encoded digital signal to be edited. 25 

11. . The digital signal processing method as set forth in 
claim 7, 

wherein the input signal that has been highly 
efficientiy encoded is read from a record medium. 30 

12. The digital signal processing method as set forth in 
claim 1 1 . 

wherein a delay of the digital signal that has 
• been highly efficiently encoded at step (a) is com- 
pensated at step (c) and then the compensated sig- 
nal is written to the record medium so that the 
phase of the compensated signal matches the 
phase of the digital signal that has been read from 
the record medium. 
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Fig. 15A 
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